feat(vllm-tensorizer): Bump vLLM to v0.20.2 on CUDA 13.2 / Ubuntu 24.04 by JustinPerlman · Pull Request #160 · coreweave/ml-containers

JustinPerlman · 2026-05-12T19:30:22Z

Summary

Bump vLLM to v0.20.2
Add a build matrix producing two variants, both on Ubuntu 24.04 / torch 2.11.0:
- v0.20.2-cuda13.2.1-ubuntu24.04
- v0.20.2-cuda12.9.1-ubuntu24.04

Ubuntu 24.04 compatibility fixes

Remove python3-pip from apt in builder-base and add rm -f /usr/lib/python3.*/EXTERNALLY-MANAGED before pip bootstrap — on Ubuntu 24.04, apt-installed pip has no RECORD file and blocks pip self-upgrade
Purge python3-jwt in the final base stage before pip installs — same root cause: Debian-managed PyJWT has no RECORD file and blocks vLLM's dependency resolution
Fix cuda-python version spec from ~=${CUDA_VERSION} to ~=${CUDA_VERSION%.*} — patch-level CUDA versions (e.g. 13.2.1) don't match available cuda-python releases; strip to major.minor
Install wheel package in lmcache-builder and restore it to builder-base pip install

Relevant information: vllm-project/vllm@6c964bd

…-containers into jperlman/vllm0.20.2

github-actions · 2026-05-12T19:43:05Z

@JustinPerlman Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/25751418629
Image: ghcr.io/coreweave/ml-containers/vllm-tensorizer:jperlman-vllm0.20.2-cc65ad3-v0.20.2

abatilo

This seems reasonable to me but I'd feel better if @Eta0 could take a peek

JustinPerlman · 2026-05-12T19:50:40Z

This seems reasonable to me but I'd feel better if @Eta0 could take a peek

Fair enough lol

alexeldeib · 2026-05-12T19:58:53Z

Pure 13.2, no matrix with 12.9? 🫣I would really like having both options…if it’s a giant pain on vllm side it’s fine, but I think you then need to validate this actually works on b40/rtxp6000 with latest supported/installed drivers cw ships

alexeldeib · 2026-05-12T19:59:44Z

I am still not aware of a cuda + driver combo that has decent support and works as expected, but haven’t followed too closely lately

github-actions · 2026-05-15T13:20:26Z

@JustinPerlman Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/25919982852
Image: ghcr.io/coreweave/ml-containers/vllm-tensorizer:jperlman-vllm0.20.2-202ef09-v0.20.2-cuda13.2.1-ubuntu24.04

github-actions · 2026-05-15T15:39:25Z

@JustinPerlman Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/25919982852
Image: ghcr.io/coreweave/ml-containers/vllm-tensorizer:jperlman-vllm0.20.2-202ef09-v0.20.2-cuda12.9.1-ubuntu24.04

alexeldeib

seems sane

Eta0 · 2026-05-15T18:17:28Z

I'd personally suggest to not repeat yourself as much in this config file and to construct more parts of this dynamically, like the tag suffix, but not a hard requirement.

Eta0 · 2026-05-15T18:19:21Z

+    rm -f /usr/lib/python3.*/EXTERNALLY-MANAGED && \
+    python3 -m pip install -U --no-cache-dir pip packaging 'setuptools>=77.0.3,<81.0.0' wheel setuptools_scm regex build


rm -f /usr/lib/python3.*/EXTERNALLY-MANAGED and pip/setuptools installation and upgrading is already handled by the torch image, so you don't need to repeat those bits here.

Eta0 · 2026-05-15T18:20:27Z

+    apt-get install -y --no-install-recommends curl libsodium23 libnuma-dev && \
+    apt-get purge -y python3-jwt && \
+    apt-get clean && \
+    rm -f /usr/lib/python3.*/EXTERNALLY-MANAGED


Same comment as before: this rm is already handled by the base image.

Eta0 · 2026-05-15T18:20:35Z

-RUN apt-get -qq update && apt-get install -y --no-install-recommends curl libsodium23 libnuma-dev && apt-get clean
+RUN apt-get -qq update && \
+    apt-get install -y --no-install-recommends curl libsodium23 libnuma-dev && \
+    apt-get purge -y python3-jwt && \


What's that apt-get purge -y python3-jwt for? 👀

JustinPerlman added 12 commits May 11, 2026 10:41

feat(vllm-tensorizer): Upgrade vLLM to v0.20.2

1161cbf

feat(vllm-tensorizer): Upgrade vLLM to v0.20.2

d2d91eb

Merge branch 'jperlman/vllm0.20.2' of https://github.com/coreweave/ml…

1530550

…-containers into jperlman/vllm0.20.2

feat(vllm-tensorizer): Bump CUDA to 13.2

6ff9df5

fix(vllm-tensorizer): Use major.minor for cuda-python version spec

8bb3975

feat(vllm-tensorizer): Bump Ubuntu to 24.04

c6e25b4

fix(vllm-tensorizer): Fix pip install for Ubuntu 24.04

d20edb0

fix(vllm-tensorizer): Install wheel for lmcache-builder

3fc0de1

fix(vllm-tensorizer): Restore wheel to builder-base pip install

3a87962

fix(vllm-tensorizer): Remove EXTERNALLY-MANAGED in final image stage

81b22b2

fix(vllm-tensorizer): Purge system python3-jwt before pip installs

1305884

chore(vllm-tensorizer): Fix minor style inconsistencies

cc65ad3

JustinPerlman self-assigned this May 12, 2026

JustinPerlman requested a review from a team as a code owner May 12, 2026 19:30

JustinPerlman requested review from abatilo and ritazh May 12, 2026 19:47

abatilo reviewed May 12, 2026

View reviewed changes

JustinPerlman requested a review from Eta0 May 12, 2026 19:51

feat(vllm-tensorizer): Build matrix for CUDA 12.9 and CUDA 13.2

202ef09

alexeldeib approved these changes May 15, 2026

View reviewed changes

JustinPerlman merged commit 287015d into main May 15, 2026
9 checks passed

JustinPerlman deleted the jperlman/vllm0.20.2 branch May 15, 2026 15:53

Eta0 reviewed May 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(vllm-tensorizer): Bump vLLM to v0.20.2 on CUDA 13.2 / Ubuntu 24.04#160

feat(vllm-tensorizer): Bump vLLM to v0.20.2 on CUDA 13.2 / Ubuntu 24.04#160
JustinPerlman merged 13 commits into
mainfrom
jperlman/vllm0.20.2

JustinPerlman commented May 12, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 12, 2026

Uh oh!

abatilo left a comment

Uh oh!

JustinPerlman commented May 12, 2026 •

edited

Loading

Uh oh!

alexeldeib commented May 12, 2026

Uh oh!

alexeldeib commented May 12, 2026

Uh oh!

github-actions Bot commented May 15, 2026

Uh oh!

github-actions Bot commented May 15, 2026

Uh oh!

alexeldeib left a comment

Uh oh!

Uh oh!

Eta0 May 15, 2026

Uh oh!

Eta0 May 15, 2026

Uh oh!

Eta0 May 15, 2026 •

edited

Loading

Uh oh!

Eta0 May 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		rm -f /usr/lib/python3.*/EXTERNALLY-MANAGED && \
		python3 -m pip install -U --no-cache-dir pip packaging 'setuptools>=77.0.3,<81.0.0' wheel setuptools_scm regex build

Conversation

JustinPerlman commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Ubuntu 24.04 compatibility fixes

Uh oh!

github-actions Bot commented May 12, 2026

Uh oh!

abatilo left a comment

Choose a reason for hiding this comment

Uh oh!

JustinPerlman commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexeldeib commented May 12, 2026

Uh oh!

alexeldeib commented May 12, 2026

Uh oh!

github-actions Bot commented May 15, 2026

Uh oh!

github-actions Bot commented May 15, 2026

Uh oh!

alexeldeib left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Eta0 May 15, 2026

Choose a reason for hiding this comment

Uh oh!

Eta0 May 15, 2026

Choose a reason for hiding this comment

Uh oh!

Eta0 May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Eta0 May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

JustinPerlman commented May 12, 2026 •

edited

Loading

JustinPerlman commented May 12, 2026 •

edited

Loading

Eta0 May 15, 2026 •

edited

Loading

Eta0 May 15, 2026 •

edited

Loading